Validity Measure of Cluster Based On the Intra-Cluster and Inter-Cluster Distance

نویسنده

  • Rahul Malik
چکیده

The k-means method has been shown to be effective in producing good clustering results for many practical applications. However, a direct algorithm of k-means method requires time proportional to the product of number of patterns and number of clusters per iteration. This is computationally very expensive especially for large datasets. The main disadvantage of the k-means algorithm is that the number of clusters, K, must be supplied as a parameter. In this paper we present a simple validity measure based on the intra-cluster and inter-cluster distance measures which allows the number of clusters to be determined automatically. The basic procedure involves producing all the segmented dataset for 2 clusters up to Kmaxclusters, where Kmaxrepresents an upper limit on the number of clusters. Then our validity measure is calculated to determine which is the best clustering by finding the minimum value for our measure.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Biochemical and bioassay studies on the influence of different organic manures on the growth of Mulberry Variety V1 and silkworm, Bombyx mori Linn.

The genetic variation and diversity among fifty-eight polyvoltine silkworm genotypes was estimated by using ten economic traits. The results revealed that the single shell weight showed higher genetic variation such as PCV% (17.20%), GCV% (12.93%), and heritability (56.5%) followed by single cocoon weight, shell ratio and matured larval weight. The D2 (Mahalonobis? distance) statistics reveal...

متن کامل

An analysis of genetic variation and divergence in Indian tropical polyvoltine silkworm (Bombyx mori L.) genotypes

The genetic variation and diversity among fifty-eight polyvoltine silkworm genotypes was estimated by using ten economic traits. The results revealed that the single shell weight showed higher genetic variation such as PCV% (17.20%), GCV% (12.93%), and heritability (56.5%) followed by single cocoon weight, shell ratio and matured larval weight. The D2 (Mahalonobis? distance) statistics reveal...

متن کامل

A kernelized fuzzy c-means algorithm for automatic magnetic resonance image segmentation

In this paper, we present alternative Kernelized FCM algorithms (KFCM) that could improve magnetic resonance imaging (MRI) segmentation. Then we implement the proposed KFCM method with considering some spatial constraints on the objective function. The algorithms incorporate spatial information into the membership function and the validity procedure for clustering. We use the intra-cluster dist...

متن کامل

ارزیابی روش‌های گروه‌بندی ژنوتیپ های کلزا با استفاده از تجزیه تابع تشخیص خطی فیشر

Discrimination function analysis is a method of multivariate analysis that can be used for determination of validity in cluster analysis. In this study, Fisher’s linear discrimination function analysis was used to evaluate the results from different methods of cluster analysis (i.e. different distance criteria, different cluster procedures, standardized and un-standardized data). Furthermore, H...

متن کامل

ارزیابی روش‌های گروه‌بندی ژنوتیپ های کلزا با استفاده از تجزیه تابع تشخیص خطی فیشر

Discrimination function analysis is a method of multivariate analysis that can be used for determination of validity in cluster analysis. In this study, Fisher’s linear discrimination function analysis was used to evaluate the results from different methods of cluster analysis (i.e. different distance criteria, different cluster procedures, standardized and un-standardized data). Furthermore, H...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012